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Abstract  -  This  p^ier  discusses  a  family  of  analytical  models  from  which  the  joint 
probat^ty  of  total  program  cost  and  schedule  can  be  emulated,  analyzed,  and  presented  to 
(tecision-makers.  Specifically,  the  classical  bivariate  nmmal  and  two  lesser  known  joint 
distributions,  the  normal-lognormal  and  the  bivariate  lognormal  distributions  are 
discussed.  Experiences  from  Monte  Carlo  simulations  suggest  that  this  family  of 
bivariate  distributions  are  candidate  models  for  computing  joint  and  conditional  cost  and 
schedule  probabilities.  In  particular,  the  discussion  on  the  normal-lognormal  distribution 
as  a  joint  cost-schedule  probabUity  model  extmids  re.searcb  on  the  applicability  of  the 

bivariate  lognormal  presented  last  year*. 

Joint  probability  distributions  enable  analysts  and  decision-makers  to  determine  joint  and 
conditional  probabilities  of  the  types 


P(Cost  ^  a  and  Schedule  ^  b) 
P(Cost  2  a|  Schedule  =  b) 


at  the  program  level.  Probability  statements  such  as  these  have  not  been  a  common 
product  of  cost  uncertainty  analyses,  and  the  development  of  analytical  methods  that 
produce  these  probabilities  is  an  advance  in  the  state  of  the  practice. 

General  formulas  for  the  correlation  functions,  conditional  distributions,  and  moments  of 
the  family  of  joint  probability  distributions  described  in  this  paper  are  provided.  Cost 
analysis  applications  are  presented  to  illustrate  the  use  rtf’  diese  distributions  in  a  practical 
context. 
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This  paper  proposes  a  family  of  analytical  models  from  which  the  joint 
probability  of  total  program  cost  and  schedule  can  be  described, 
analyzed,  and  presented  to  decision-makers.  Specifically,  the  classical 
bivariate  normal  and  two  lesser  known  joint  distributions,  the  normal- 
lognormal  and  the  bivariate  lognormal  distributions  are  discussed. 
Experiences  from  Monte  Carlo  simulations  suggest  that  this  family  of 
bivariate  distributions  are  candidate  models  for  confuting  joint  and 
conditional  cost  and  schedule  probabilities.  In  particular,  the 
discussion  on  the  normal-lognormal  distribution  as  a  joint  cost-schedule 
probability  model  extends  research  on  the  applicability  of  the  bivariate 
lognormal  presented  last  year^. 

Joint  probability  distributions  enables  analysts  and  decision-makers  to 
conduct  tradeoffs  on  joint  and  conditional  probabilities  of  the  types 

P(Cost  5  a  and  Schedule  ^  b) 

P(Cost  ^  a  I  Schedule  =  b) 

at  the  program  level.  Probability  statements  such  as  these  have  not 
been  a  common  product  of  cost  uncertainty  analyses,  and  the  development 
of  analytical  methods  that  produce  these  probabilities  is  an  advance  in 
the  state  of  the  practice. 

General  formulas  for  the  correlation  functions,  conditional 
distributions,  and  moments  of  the  fcutvily  of  joint  probability 
distributions  described  in  this  paper  are  provided. 
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A  Family  of  Joint  Probability  Models  for 
Cost  and  Schedule  Uncertainties 


Paul  R.  Garvey 

The  MITRE  Corporation,  Bedtbrd,  Massachusetts 

27th  Annual  Department  of  Defense  Cost  Analysis  Symposium 
September  1993 

Abstract  -  This  paper  discusses  a  family  of  analytical  models  from  which  the  joint 
probability  of  total  program  cost  and  schedule  can  be  calculated,  analyzed,  and  presented  to 
decision-makers.  Specifically,  the  cla.ssical  bivariate  normal  and  two  le.s.ser  known  joint 
distributions,  the  normal-lognormal  and  the  bivariate  lognormal  distributions  are 
discussed.  Experiences  from  Monte  Carlo  simulations  suggest  that  this  family  of 
bivariate  distributions  are  candidate  models  for  computing  joint  and  conditional  co.st  and 
schedule  probabilities.  In  particular,  the  dLscu.s.sion  on  the  normal-lognormal  disuibution 
as  a  joint  cost-schedule  probability  model  extends  research  on  the  applicability  of  the 
bivariate  lognormal  presented  last  year*. 

Joint  probability  distributions  enable  analysts  and  decision-makers  to  determine  joint  and 
conditiotuil  pro^ilities  of  the  types 

P(Cost  <,  a  and  Schedule  ^  b) 

P(Cost  <  a|  Schedule  =  b) 


at  the  program  level.  Probability  statements  such  as  these  have  not  been  a  common 
product  of  cost  uncertainty  analyses,  and  the  development  of  analytical  methods  that 
produce  these  probabilities  is  an  advance  in  the  state  of  the  practice. 


General  formulas  for  the  correlation  functions,  conditional  distributions,  and  moments  of 
the  family  of  joint  probability  distributioas  described  in  this  paper  are  pntvided.  Cost 
analysis  applicatioas  are  presented  to  illastraie  the  use  of  the.se  distribuiiiHis  in  a  practical 
context. 
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I.  Introduction 


When  cost  and  schedule  uncertainty  analyses  are  presented  to  decision-makers, 
questions  asked  with  increased  urgency  are:  What  is  the  likelihoiKl  of  achieving 
both  the  estimated  cost  and  schedule?  What  is  the  chance  of  exceeding  the  most 
likely  cost  for  a  given  schedule?  How  are  cost  reserve  recommendations  driven  by 
schedule  uncertainties?  Questions  such  as  these  are  not  readily  answered  by 
current  analysis  tools  and  techniques.  This  paper  advances  the  state  of  the  practice 
by  presenting  a  family  of  probability  models  designed  to  specifically  address  these 
and  related  questions. 

During  the  past  thirty  years,  uncertainty  analysis  methods  have  applied  univariate 
IM:obabiIity  theory  to  generate  separate  distributions  of  a  program's  estimated  cost 
and  schedule.  These  distributions  provide  decision-makers  with  probabilities  such 
as 

P(Cost^a)  (1) 

and 

P(Schedule^b)  (2) 

Although  it  has  long  been  recognized  that  cost  and  schedule  are  interdependent, 
little  has  been  adopted  from  multivariate  theory  to  combine  cost  and  schedule 
probability  distributions.  A  multivariate  model  that  combines  these  distributions 
would  provide  decision-makers  joint  and  conditional  probabilities  such  as 


P(Cost  ^  a  and  Schedule  <  b) 

(3) 

P(Cost  ^  a  1  Schedule  =  b) 

(4) 

In  i»^actice,  probabilities  in  (3)  and  (4)  are  of  greater  interest  to  decision-makers 
than  those  in  (1)  and  (2).  This  is  particularly  true  in  the  early  planning  phases  of  a 
{xogram  >^ere  critical  cost  and  schedule  decisions  are  made. 


This  paper  {U’esents  the  cost  analysis  community  with  a  family  of  multivariate 
probability  models  from  which  joint  and  conditional  distributions  of  program  cost 
and  schedule  can  be  developed.  Specifically,  the  classical  bivariate  normal  and  two 
lesser  known  Joint  distributions,  the  bivariate  normal-lognormal  and  the  bivariate 
lognormal  distributions  are  discussed.  The  development  of  the  bivariate  normal - 
lognormal  distribution,  and  its  application  in  the  cost  analysis  domain,  extends 
research  [11  on  the  utility  of  joint  probability  models  for  program  cost  and 
schedule.  Appendixes  are  provided  that  document  the  properties  of  the  bivariate 
normal-lognormal  and  bivariate  lognormal  models.  Properties  of  the  bivariate 
nomal  are  well  known  and  can  be  found  in  [2]. 

The  models  proposed  in  this  paper  are  analytical.  They  facilitate  computing  the 
probabilities  defined  in  (3)  and  (4).  Furthermore,  the  means  and  variances  from 
univariate  cost  and  schedule  distributions  developed  from  traditional  techniques  can 
be  used  to  specify  the  parameters  of  a  bivariate  probability  model  assumed  for  a 
given  program.  Thus,  no  new  or  additional  analysis  effort  is  required  to  define  and 
apply  these  models.  Lastly,  these  models  explicitly  incorporate  correlation  between 
cost  and  schedule  on  a  given  program  -  an  issue  not  well  addressed  in  current 
analysis  m^hodology. 

Formulas  are  developed  diat  combine  program  cost  and  schedule  probability 
distributions  to  produce  conditional  cost  distributions  for  a  given  set  of  schedules. 
For  example,  in  figure  1  the  curves  with  solid  lines  are  unconditional  distributions 
of  a  program's  cost  and  schedule.  They  represent  probabilities  of  the  types  in  (1) 
and  (2).  Formulas  are  provided  in  section  Ill  that  incorporate  statistics  from  these 
distributions  to  form  conditional  cost  distributions  for  a  given  set  of  schedules. 
The  curves  with  dashed  lines  in  figure  1  are  examples  of  conditional  cost 
distributions.  Conditional  distributions  represent  probabilities  of  the  type  defined 
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in  (4).  They  are  an  effective  way  of  directly  linking  a  given  schedule  to  cost 
recommendations  tied  to  a  specified  level  of  confidence. 
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Figure  1.  Illustrative  Probability  Distributions  of  Program  Cost  and  Schedule 


In  summary,  this  paper  advances  current  practice  by  offering  a  family  of 
multivariate  models  ttiat  produce  joint  and  conditional  probabilities  of  program  cost 
and  schedule.  Cost  analysis  t^ications  that  illustrate  the  mathematical  theory  are 
inesented. 


n.  JOINT  PROBABILITY  MODELS 
AND  THEIR  SUPPORT  TO  RISK  MANAGEMENT 

This  section  briefly  defines  the  domain  of  the  models  described  in  this  paper  and 
how  diey  provide  analyses  that  supports  risk  management  decisions.  An  overview 
of  the  risk  management  decision  space  is  discussed.  A  context  for  how  these 
models  are  defined  within  that  space  is  provided.  Finally,  the  types  of  uncertainties 
fiiat  can  be  quantified  by  these  models  are  discussed. 


A.  Models  and  the  Risk  Management  Decision  Space 


Cost,  schedule,  and  performance  objectives  define  the  domain  of  the  program 
manager's  decision  space.  A  characterization  of  this  space  is  presented  in  figure  2. 
Illustrated  is  how  user  expectations  on  cost,  schedule,  and  performance  are  often  at 
odds  with  what  can  actually  be  delivered.  Risk  is  introduced  when  expectations  in 
any  of  these  dimensions  push  what  is  technically  and/or  economically  feasible. 
Managing  risk,  then,  is  managing  the  inherent  contention  that  exists  within  each 
axis  and  across  all  three.  The  goal  of  risk  management  is  to  identify  cost, 
schedule,  and  pofoimance  risks  early  such  that  control  on  either  axis  is  not  lost,  or 
that  the  consequences  of  risk  mitigating  actions  on  all  three  axes  are  well 
understood. 


Performance 


Figure  2.  Models  and  the  Risk  Management  Decision  Space 


The  joint  (n^obability  models  presented  in  section  III  operate  in  two  of  the  three 
dimensions  shown  in  figure  2.  They  are  bivariate  in  the  cost  and  schedule  plane. 
When  iq)plying  these  models  it  is  assumed  that  a  system  architecture  has  been 
defined,  or  hypothesized,  by  the  engineering  team.  It  is  also  assumed  that  this 
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architecture  will  achieve  the  performance  requirements  as  they  are  specified  or 
understood  at  that  time. 


Thus,  for  a  given  system  architecture,  the  uncertainties  c^tured  by  these  models 
are  those  associated  with  its  technical  definition  and  resource  (cost-schedule) 
estimation  ap^n^oaches.  Examples  of  technical  definition  uncertainties  include 
estimating  the  amount  of  new  code  to  develop,  or  the  number  of  custom 
miaochips  to  design  and  fabricate,  or  the  delivered  weight  of  a  newly  designed 
antenna  pedestal.  Examples  of  resource  estimation  uncenainties  include  errors  in 
cost  models  and  cost  methodologies,  uncertainties  in  estimated  activity  durations, 
economic  uncertainties  as  they  affect  the  cost  of  technology  (e.g., 
inflation/deflation),  or  uncertainties  in  the  data  (e.g.,  labor  rates,  prcxluctivity  rates) 
used  to  develop  the  resource  estimates. 


Once  a  joint  cost-schedule  probability  model  has  been  developed  on  a  program, 
analyses  can  be  conducted  to  support  risk  management  decisions.  These  include 

Baselining  Program  Cost  and  Schedule  Risk  -  For  a  given  system 
architecture,  acquisition  strategy,  and  resource  estimation  approach 
baseline  {mibability  distributions  of  a  program's  cost  and  schedule 
can  be  developed.  These  distributions  can  be  periodically  updated  as 
the  program's  uncertainties  change  across  the  acquisition 
milestones.  Generating  these  distributions  supports  estimating  a 
program  cost  and  sch^ule  that  simultaneously  have  a  specified 
peccability  of  not  being  exceeded.  This  distribution  also  provides 
program  manage  widi  an  assessment  of  the  likelihood  of  achieving 
a  budgeted  or  {»'oposed  cost  and  schedule  or  cost  for  a  given 
schedule. 

Estimating  Reserves  -  An  analytical  basis  for  estimating  cost  reserve 
for  a  given  schedule,  or  set  of  schedules,  as  a  function  of  the 
uncertainties  specific  to  a  given  system  can  be  developed. 
Sensitivity  analyses  can  be  conducted  to  assess  how  reserve  levels 
are  affected  by  changes  in  specific  program  risks.  In  addition,  the 
relationship  between  the  amount  of  reserve  to  recommend  for  a 
given  level  of  confidence  and  a  given  schedule  can  be  examiml. 
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Conducting  Risk  Mitigation  Tradeoff  Analyses  -  Models  can  be 
developed  to  study  the  payoff  of  implementing  specific  risk 
amelioration  activities  (e.g.,  prototyping)  on  reducing  the  estimated 
cost  and  schedule  variances.  For  instance,  suppose  that  the  cost  and 
schedule  variances  of  a  program  are  driven  by  uncertainty  in  the 
amount  of  new  code  to  develop.  Using  these  models,  it  can  be 
determined  whether  investing  in  a  prototyping  effort  markedly 
reduces  this  variance,  and,  therefore,  lessen  the  cost  and  schedule 
les^es  estimated  to  the  program. 

m.  A  FAMILY  OF  JOINT  PROBABILITY  MODELS 
FOR  COST  AND  SCHEDULE 


This  section  presents  a  family  of  joint  probability  models  for  program  cost  and 
schedule.  Specifically,  the  bivariate  normal,  the  bivariate  normal-lognormal,  and 
the  bivariate  lognormal  models  are  described.  These  models  are  candidate 
theoretical  distributions  that  might  be  assumed  by  an  analyst  when  joint  or 
conditional  cost-schedule  probabilities  are  needed. 

The  models  described  herein  have  two  features  desirable  for  cost  analysis 
applications,  first,  they  directly  incorporate  correlation  between  cost  and  schedule 
on  a  given  program.  Second,  they  have  the  property  that  their  marginal 
distributions*  are  either  both  normal,  normal  and  lognormal,  or  both  lognormal. 
This  is  reflective  of  the  distributions  frequently  observed  in  Monte  Carlo 
simulations  [1,3]  of  program  cost  and  schedule. 

The  following  briefly  describes  these  models.  A  cost  analysis  application  is  also 
presented.  Throughout  the  remainder  of  this  paper  the  random  variables  X  i  and 
X2  will  denote  program  cost  and  schedule,  respectively. 

*  If  Xi  and  X2  are  random  variables  with  joint  probability  distribution  Fx,,x,  (^i  •’^2  )<  then  the 
marginal  distributions  of  Xi  and  X2  are  the  individual  probability  distributions  Fx^  (Xj )  and 
I^,(*2)’  respectively. 
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A  The  Classical  Bivariate  Normal  Model 

This  section  [H^esents  the  classical  bivariate  normal  and  summarizes  its  major 
characteristics.  An  important  feanire  of  this  model  are  its  marginal  distributions, 
which  are  both  normal. 

In  cost  analyses,  normal  distributions  can  arise  when  program  cost  is  the  sum  of 
many  uncorrelated  WBS  cost  element  random  variables.  Normal  distributions  can 
also  occur  in  schedule  analyses.  For  instance,  program  schedule  is  approximately 
normal  if  it  is  the  sum  of  many  independent  activities  in  a  network.  Thus,  if 
normal  distributions  characterize  a  program's  cost  and  schedule  distributions,  then 
the  bivariate  normal  could  serve  as  an  assumed  model  of  their  joint  distribution. 

A.I  Model  Definition 

Suppose  we  have  two  random  variables  Yj  =  X|  and  Y2  =  X2  where  X]  and  X2 
are  defined  on <««o  and -<»><X2  <«>.  If  Y  ]  and  Y  2  each  have  a  normal 
distribution  then,  for  i  -1, 2.  the  mean  and  variance  of  Yi  are 

E(Yi)  =  Py,  =1^1  (5) 

E(Y2)  =  4y2  =1^X2  (6) 

Var(Y,)  =  4,  =4,  =0?  (7) 

Var(Y2)  =  OY2  =fyx2  (8) 

Assume  that  the  pair 

(X1.X2)- Bivariate  N((pi,p2).(<yi.<y2.Pi.2))  (9) 

is  a  bivariate  normal  distribution  with  density  function  [2] 
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fXi.X2(*l>*2) 


_ 1 

•  (27t)0i02Yl^f2 

0  otherwise 


<Xj  <»,  — '»<X2<<” 


(10) 


where 


w  = 


'-Pu 


.  <^1  J 


-2pj 


^Xi-HiYx2-il2 

\  a,  X  02 


02 


and  jij  and  of  (i  =  1,  2)  are  given  by  equations  (5)  through  (8).  The  correlation 
term  2  equation  (10)  is 

Pu  ~Py,.Y2  “PX1.X2  (11) 

If  two  continuous  random  variables  X]  and  X2  have  a  bivariate  normal 
distribution,  th^n 

P(ai  ^ Xi  ^ bi  and  a2  ^  X2  ^  b2) 


is  given  by 

fb2  fb| 

1  J  fx,.X2(’^l*X2)dXidX2 

*'82  *'81 

where,  in  this  case,  f  is  defined  by  equation  10. 


(12) 


AM  Model  Characteristics 


A  diaractoistic  of  the  bivariate  normal  distribution  is  that  the  distribution  of  Xi 
and  the  distribution  of  X2  are  each  univariate  normal.  These  are  the  marginal 
distributions  and  they  are  given  by 


(13) 
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(14) 


In^XHtant  tradeoffs  in  cost  analysis  often  involve  assessing  the  impact  that  a  given 
set  of  schedules  have  on  the  likelihood  that  program  cost  will  not  exceed  a 
in’escribed  threshold.  To  make  these  assessments,  the  conditional  probability 
distribution  is  needed.  Conditional  distributions  provide  probabilities  of  the  type 

P(Xi<a|X2  =b) 

If  two  continuous  random  variables  Xi  and  X 2  have  a  bivariate  normal 
distribution,  then  the  conditional  probability  density  function  of  Xi  given  X2  =  X2. 

denoted  by  fxi|x2  ^*1^’  distributed  [2].  That  is 


Xi|x2  -N(Pi+— ^Pl_2(X2  -M2)-®?(^”P?.2)) 

O2 

(1.S) 

similarly 

X2|x|  ~N(P2+^Pi.2(Xi  ~Pi).02(^“P?,2)) 

®1 

(16) 

From  (IS)  and  (16)  the  conditional  means  and  variances  are 

E(Xi|x2)  =  11i+^Pi,2(*2  “P2) 

O2 

(17) 

E(X2|xi)  =  P2+^Pi.2(xi  “Pi) 

(18) 

and 

Var(Xi|x2)  =  a?(l-pf2) 

(19) 

Var(X2|xi)  =  ai(l-p?.2) 

(20) 

Section  D  will  illustrate  how  conditional  distributions  are  developed  to  provide 
decision-makers  with  cost-schedule  probabilities  such  as 


P(Cost  <  a|  Schedule  =  b) 
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B.  A  Bivariate  Normal-LogNormal  Model 


This  section  presents  a  bivariate  nonnal-lognormal  nuxlel  and  summarizes  its  major 
characteristics.  Unlike  the  classical  bivariate  normal,  very  little  exists  in  the 
statistical  literature  on  this  model.  Furthermore,  the  few  references  [4,.S]  that 
discuss  the  bivariate  normal-lognormal  define  it  by  parameters  not  computed  in 
cost-schedule  uncertainty  analyses.  Thus,  an  alternative  form  of  the  model  than 
that  offered  in  [4,5]  was  created  specifically  for  this  application  context.  Its  form 
is  a  direct  extension  of  the  bivariate  normal  presented  in  .section  A.  A  theoretical 
discussion  of  this  model  is  provided  in  appendix  A. 

An  important  feature  of  the  bivariate  normal-lognormal  model  is  its  marginal 
distributions.  One  marginal  is  normal  and  the  other  is  lognormal  (refer  to  appendix 
A).  Section  A  described  circumstances  that  give  rise  to  normal  marginal 
distributions  for  cost  or  schedule.  Under  certain  conditions,  lognormal 
distributions  can  characterize  simulated  program  cost  and  schedule  distributions 
[1,3].  For  instance,  it  is  frequently  observed  that  the  lognormal  approximates  the 
simulated  distribution  of  program  cost  if  it  is  the  sum  of  many  positively  correlated 
cost  element  random  variables.  Similarly,  program  schedule  can  also  tend  towards 
lognormality  if  it  is  the  sum  of  many  positively  correlated  schedule  activities. 

These  observations  stem  from  practical  experiences  with  Monte  Carlo  simulations. 
They  are  not  theoretical  findings.  Exceptions  to  these  observations  can  be  created. 
However,  if  a  situation  arises  where  normal  and  lognormal  distributions 
characterize  a  program's  cost  arid  schedule  distributions,  then  the  bivariate  normal- 
lognormal  could  serve  as  an  assumed  model  of  their  Joint  distribution. 
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B.I  Model  Definition 


Suj^se  we  have  two  random  variables  Yj  =  and  Y2  =  lnX2  where  Xi  and 
X2  are  defined  on -00  <  X]  <  00  andO<X2<<».  If  Y 1  and  Y  2  each  have  a 
normal  distribution  then,  for  i  =1. 2,  the  mean  and  variance  of  Yi  [1]  are 

E(Yi)  =  jiy,  =lfx,  =Mi  (21) 

Var(Yi)  =  ai,  =0^,  =0?  (22) 


E(Y2)  =  |1y2  =42  =  jln 
Var(Y2)  =  OY2  =  O2  =  •” 


(MX,/ 

(MX2/+OX2 

(Mx2/-*-gX2 

(Mxj/ 


Assume  that  the  pair 

(Xi,X2)  -  Bivariate  NLogN((iii.M2  ).(Oi  .O2 .  p,  2  » 
is  a  bivariate  normal-lognormal  distribution  with  density  function 


(23) 

(24) 


(25) 


fXi,X2(’^l’*2)  =  ] 
vriieie 


(2jl)Oia2)/l-pf  2  *2 
0  otherwise 


i  —00  <  X|  <00  ,  {) 


<X2<~  (26) 


w  = 


1 


1-P?,2 


Xi-Mi 


K  k  02 


^  ^lnx2  -li2  ^ 
02 


and  Pi  and  (i  =  1,2)  are  given  by  equations  (21)  through  (24).  From  appendix 
A  (theorem  A-1)  the  coirdation  term  Pj  2  in  equation  (26)  is 


(27) 


Pl.2  *Py,.Y2  -PXpInXj  -Pxj.Xj  02 

Equation  27  can  be  found  in  P.  T.  Yuan  (1933)  [6],  Proofs  have  been  developed 
by  P.  H.  Young*  and  Ihe  author  (refer  to  appendix  A). 


If  two  continuous  random  variables  Xi  and  X2  have  a  bivariate  normal-lognormal 
distribution,  then 

P(ai  ^ X,  < bi  and 32  ^ X2  ^  bj) 


is  given  by 


J  I  fx,.X2(*l.*2)dx,dX2 

•'“a  ■'»! 


(28) 


where,  in  this  case,  f  is  defined  by  equation  26 
B.II  Model  Characteristics 


A  characteristic  of  the  bivariate  normal-lognormal  distribution  is  that  the 
distribution  of  Xi  is  normal  and  the  distribution  of  X2  is  lognormal.  Tliese 
marginal  distributions  are  given  by 


fi(xi)=- 

f2(X2)  = 


1 

/2jtai 

1  “1*2  ] 
O2X2 


(29) 

(30) 


P.  H.  Young  [The  Aerospace  Corporation,  1992]  developed  a  proof  of  (27).  His  approach 
used  the  dehnition  of  EfXjXj)  to  determine  pi  The  proof  of  (27)  presented  in  i^>pendix  A  is  a 
slight  variation  of  Young's  approach.  The  formula  for  pi,2  is  esteblished  directly  from  the 
definition  of  the  Cov(XiX2)  instead  of  E(X|X2).  Recall  that  Cov(X|X2)  =  E(XiX2)-E(Xi)E(X2). 
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The  conditional  probability  density  ftinction  of  X]  given  X2  =  X2.  denoted  by 
^Xi|x2  itormally  distributed  (refta^  to  appendix  A,  theorem  A-3).  That  is 

Xi(x2  ~ N(|ii+ — ^Pi.2(lnx2 -it2)>®?(l“Pi.2))  (30 

02 

staiilarly  X2IX1  -LogN(P2+^Pi2(*i“i*i)-O20-P?.2)) 

From  s^ipendix  A  (theorem  A-4)  the  conditional  meaas  and  variances  are 

E(Xi|x2)  =  Pi+^p,2(lnx2-M2) 

02 

M  2  +^Pl.2  ( *1  -P 1  )+io?  ( 1-p?  2 ) 

E(X2|xi)  =  e 
and 

Var(Xi|x2)=o?(l-p?.2) 

2(P2+^Pl,2(xi-i‘l)) 

Var(X2|xi)  =  e  e^e^-1) 

wherfe  z  =  02(l-Pi,2) 

Section  D  will  illustrate  how  conditional  distributions  from  the  normal-lognormal 
model  are  developed  to  provide  decision-makers  with  cost-schedule  pobabilities 
such  as 

P(Cost  ^  a|  Schedule  =  b) 

C.  The  Bivariate  LogNomud  Model 

Tliis  section  presents  the  bivariate  lognormal  model  and  summarizes  its  major 
characteristics.  Research  pesented  last  year  [1]  demonstrated  that,  undo^  certain 
conditions,  this  model  can  approximate  a  program's  cost-schedule  joint  and 


(32) 

(33) 

(34) 

(35) 

(36) 
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conditional  probaUlities.  The  bivariate  lognormal  has  a  number  of  characteristics 
desirable  for  cost  analysis  applications.  Among  these  are 


•  It  is  bounded  by  zero  (cost  and  schedule  cannot  become  negative), 

•  Its  marginal  and  conditional  distributions  are  lognormal,  which  is 
frequently  observed  in  Monte  Carlo  simulations  of  program  cost  and 
sch^le  [1.3]. 


C.I  Model  Definition 


Suppose  we  have  two  random  variables  Y|  =  In  X|  and  Y2  =  In  X2  where  X]  and 
X2  are  defined  on  0  <  Xj  <  00  andO<X2<‘“.  If  Yi  and  Y2  each  have  a  normal 
distribution  then,  for  i  =1, 2.  the  mean  and  variance  of  Yj  [  1  ]  are 


1 


E(Yi)  =  PYi  =4i=rln 


2 

Var(Yi)  =  oi.  =o?=lnj 


(MXj)  +qx, 


(Mx^r 


Assume  ttiat  the  pair 

(Xi  ,X2 )  -  Bivariate  LogN ((Pi . ]i2 )» -o  2 .  Pi, 2  » 
is  a  bivariate  lognormal  distribution  with  density  function 

»  A 


%,X2  = 


(2n)Ci02^l-pl2^lX2 


0  otherwise 


where 
w  = 


(37) 


(38) 


(39) 


(Xxi  <«»,() <X2  <«»  (40) 


'inxi-pif  „  f  Inxi  -Pi  Y lnx2  ~P2  1,  f  >nx2  -P2 
.  oi  j  I  ^,2  r[  02 
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and  iij  andOj  (i  =  1,  2)  are  given  by  equations  (37)  and  (38).  From  af^ndix  B 
(flieorem  B-2),  the  correlation  term  Pj  2  in  equation  (40)  is 


1  . 

Pi  2= - 

‘•2  0,02 


(41) 


If  two  continuous  random  variables  Xi  and  X  2  have  a  bivariate  lognormal 
distribution,  then 

P(ai  <  Xi  <  bi  and  aj  <  X2  <  b2) 

is  given  by 

T' fx,.X2(*i-*2)dXidx2  (42) 

^32  •'aj 

where,  in  this  case,  f  is  defined  by  equation  40. 

CM  Model  Oiamcteristics 

A  characteristic  of  the  bivariate  lognormal  distribution  is  that  the  distribuoon  of  X 1 
and  the  distribution  of  X2  are  each  lognormal.  These  marginal  distributions  are 
given  by 


y2lt  0|  X| 

(43) 

^  ^jln  02X2 

(44) 

The  conditional  probability  density  ftiiKtion  of  X 1  given  X2  =  X2.  denoted  by 
^Xi|x2  iosnonnaUy  distributed  (refer  to  appendix  B,  theorem  B-4).  That  is 

Xi|x2-LogN(pi+-^pi,2(lnx2-P2).Oi(i-Pi.2))  (^5) 

02 
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(46) 


similarly 

X2I  xi  -  LogN(P2  +  ^  Pudn  -  M 1 ) .  -  Pu ) ) 

(46) 

From  appendix  B  (dieorem  B-5)  the  OMiditiona]  means  and  variances  are 

E(Xi|x  j)  =  ’ 

(47) 

E(X2|x,)  =  = ' 

(48) 

and 

Var(X,|xj )  =  X e""*' ’e"’  (e"’  - 1) 

(49) 

Var(X2|x,)= X  V(e'  - 1) 

(50) 

\^iere 

z  =  02(1-Pi.2)  and  z*  =  o?(l-p?.2) 

Section  D  will  illustrate  how  conditional  distributions  from  the  bivariate  lognormal 
model  ate  developed  to  provide  decision-makers  with  cost-schedule  probabilities 
such  as 

P(Cost  ^  a|  Schedule  =  b) 

D.  A  Cost  Analysis  Application 

The  following  illustrates,  from  a  cost  analysis  perspective,  how  the  joint 
probability  modds  described  in  this  paper  can  be  defined  and  applied.  Recall  that 
ttK  random  variables  X  i  and  X2  denote  program  cost  and  schedule,  respectively. 

Suppose  that  the  cumulative  probability  distributions  of  a  program's  cost  and 
sdiedule  are  given  in  figure  3.  Furthermore,  suppose  that  these  distributions  have 
means  and  variances  given  in  table  1  and  that  the  correlation  between  program  cost 
and  schedule,  Px,,X2  • 
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Rgure  3.  Probability  Distributions  for  Program  Cost  and  Schedule 


Table  1.  Program  Cost  and  Schedule  Statistics 


i 

Xi 

MXi 

< 

OXi 

1 

Cost($M) 

100 

625 

25 

2 

Schedule  (Months) 

48 

36 

6 

Correlation  px,,x2 

0.50 

Using  these  data,  the  param^ers  of  the  three  joint  probability  models  described  in 
ttiis  papa  can  be  d^ennined.  Hiese  parameters  are  summarized  in  table  2. 

Ftom  table  2  and  (9),  (25),  and  (39)  we  can  write 

(Xi,X2)  -  Bivariate  N((100,48),(625, 36,0.50))  (51) 

(Xi,X2)- Bivariate  NLogN((100,3.86.3),(625.0.0155,0.502))  (52) 

(X,,X2)  ~  BivariateLogN((4.575,3.863),(0.0606, 0.0155, 0.506))  (53) 
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if  ttiepair  (Xj.X^)  is  an  assumed  bivariate  normal,  an  assumed  bivariate  normal- 
lognormal.  or  an  assumed  bivariate  lognormal. 

Table  2.  Joint  Probability  Model  Parameter  Set 


i 

of 

Oi 

Bivariate  Normal  Model 

1  100r£9t5> 

625  (Eqt  7) 

25 

2  4iiEqt6) 

36  (Eqt  8) 

6 

Correlation  pi_2 

0..5()  (Eqt  11) 

Bivariate  Normal-LogNormal  Model 

1  100  (Eqt  21) 

625  (Eqt  22) 

25 

2  3.863  (Eqt  23) 

0.0155  (Eqt  24) 

0.124 

Cotrdation  pi_2 

0.502  (Eqt  27) 

Bivariate  LogNonnal  Model 

1  4.575  (Eqt  37) 

0.0606  (Eqt  38) 

0.246 

2  3.%63  (Eqt  37) 

0.0155  (Eqt  38) 

0.124 

Correlation  Pi  2 

0.5nf>  (Eqt  41) 

Once  the  joint  distribution  of  (Xi,X2)  has  been  specifled,  joint  cost-schedule 
probabilities  can  be  computed.  For  instance,  if  (Xi,X2)  is  assumed  to  be 
bivariate  ncvinal-lognormal  with  parameters  given  in  (52),  then  the  probability 

P($100M  ^  X,  ^  $1 30M  and  45  months  <  Xj  ^  55  months) 
is 

f55  fl30 

fx,.X2(xl.X2)dxidX2  =0.245 

»45  •100 
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In  this  case,  the  integrand  (Xj.Xj  (xi.*2)  is  defined  by  equation  26.  Joint  cost- 
schedule  probabilities  for  other  regions  of  interest  are  determined  in  a  similar 
manner. 


D.I  Conditional  Cost-Schedule  Distributions 

Cost  analyses  must  often  focus  on  assessing  the  impact  that  a  given  set  of 
schedules  have  on  the  likelihood  that  an  estimated  program  cost  will  not  be 
exceeded.  To  make  these  assessm^ts  the  conditional  distribution  of  program  cost 
is  needed.  Stated  previously,  conditional  distributions  produce  probabilities  of  the 
type 

P(Cost  <  a|  Schedule  =  b) 

Computing  a  conditional  probability  requires  the  joint  cost-schedule  probability 
distribution.  Recall  the  follovtdng  joint  cost-schedule  distributions  formed  from  the 
datain  table  1. 

(Xi,X2)  -  Bivariate  N((100,48),(625, 36,0.50)) 

(Xi,X2)  -  Bivariate  NLogN((100,3.863),(625,0.0155.0.502)) 

(Xi,X2)  -  Bivariate LogN((4.575,3.863),(0.0606,0.0155,0.506)) 

From  equations  15,  31,  and  45,  and  table  2,  the  conditional  distributions  of  cost 
Xi|x2  for  a  given  schedule  X2  can  be  formed.  In  this  case,  the  conditional 
distribution  of  Xi|x2  if  the  pair  (X|,X2)  is  an  assumed  bivariate  normal,  an 
assumed  bivariate  normal-lognormal,  or  an  assumed  bivariate  lognormal  are, 
respectively 


Xi|x2  -  N(100  +  2.083(X2  -  48), 468. 75 )  (54) 

Xi|x2  -  N(100-H01.21(lnx2  -3.863), 467.5)  (55) 

Xilx2  ~ LogN(4.575  +  1.0038(lnx2  -3.863), 0.045)  (56) 
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Using  the  lognormal  distribution  given  by  (56),  conditional  cumulative  cost 
distributions  given  X2  equal  to  50, 55,  and  60  months  are  shown  in  figure  4.  Table 
3  summarizes  the  statistics,  computed  from  theorems  B-4  and  B-5  (refer  to 
appendix  B),  of  these  distributions. 

From  figure  4,  the  impact  of  a  given  schedule  on  the  probability  distribution  of  cost 
can  be  determined.  Observe  that  as  x  2  increases,  the  cumulative  conditional  cost 
distributions  become  "lazier."  Shown  in  table  3,  the  increased  laziness  reflects  a 
growth  of  nearly  2  million  dollars  in  o(Xi|  X2)  for  each  5  month  increase  in  X2. 


Figure  4.  A  Family  of  Conditional  LogNormal  Cost  Distributions 
(Equation  56  with  X2  =  50,  55,  60  months) 

By  definition,  the  conditional  median  cost  occurs  with  probability 
P(Cost  ^  a|  Sctedule  =  X2 )  =  0. 50 
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where,  in  table  3,  a  is  equal  to  102, 1 12,  and  122  million  dollars  given  X2  equal  to 
50, 55,  and  60  months,  respectively.  This  reflects  the  cost  that  is  equally  likely  to 
overrun  or  underrun  for  a  given  schedule  x,.  For  this  specific  example, 
Median(Xi|  X2)  increases  by  10  million  dollars  for  every  5  month  increase  in  X2. 
A  similar  increase  is  seen  in  the  conditional  mean  cost  E(Xi|  X2 ). 

Table  3.  Statistics  from  the  Conditional  Distributions  in  Figure  4 

Givenx2  Median(Xi|x2)  E(Xi|x2)  a(Xi|x2) 

(Months)  ($M)  ($M)  ($M) 


50 

102 

104 

22.4 

55 

112 

115 

24.6 

60 

122 

125 

26.9 

Linkages  such  as  these  between  cost  uncertainty  and  schedule  can  be  made  through 
the  use  of  conditional  distributions.  In  the  early  stages  of  a  program,  conditional 
distributions  provide  decision-makers  valuable  insight  into  the  likelihood  of 
achieving  cost  and  schedule  goals. 

rV.  SUMMARY  COMMENTS 

The  family  of  joint  probability  models  described  in  this  paper  provides  an  analytical 
basis  for  computing  joint  and  conditional  cost-schedule  probabilities.  Selection  of 
a  particular  model  is  guided  by  the  marginal  distributions  it  produces.  For 
example,  if  the  individual  distributions  of  Xi  and  X2  are  observed  to  be  normal 
and  lognormal,  then  the  bivariate  normal-lognormal  model  (25)  might  be  chosen 
for  die  joint  distribution  of  the  pair  (X^ ,  X2 ).  This  is  because  the  bivariate  normal- 
lognormal  model  produces  normal  and  lognormal  marginal  distributions. 
However,  it  must  be  viewed  that  choosing  this  particular  model  makes  an 
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assiunption  about  how  the  pair  of  random  variables  (X1.X2)  is  jointly  distributed. 
The  true  joint  distribution  of  (Xj ,  X2 )  cannot  be  uniquely  determined  from  only  the 
individual  distributions  of  Xi  and  X2. 

A  parameter  required  by  the  models  in  this  paper  is  the  correlation  between 
program  cost  and  schedule.  Although  this  is  a  difficult  parameter  to  estimate, 
approaches  include  deriving  it  from  an  historical  cost-schedule  database  (one  such 
correlation  coefficient  can  be  found  in  reference  1)  or  computing  it  using  values 
sampled  from  a  simulation  of  the  cost-schedule  estimating  relationships  established 
for  a  particular  program.  Subjective  assessments  might  be  used  in  the  absence  of  a 
database  or  a  cost-schedule  simulation  model. 

An  important  consideration  regarding  the  models  herein  is  that  they  do  not  reflect 
the  causal  impact  that  schedule  compression  or  extension  has  on  .ost.  These 
models  treat  cost  and  schedule  as  correlated  random  variables  whose  range  of 
values  are  reflected  by  their  marginal  distributions.  These  ranges  result  from 
quantifying  the  uncertainties  associated  wiui  a  specific  technical  baseline  and  the 
cost-schedule  estimation  approaches.  Unrealistically  compressing  or  extending 
schedule  (due  to  missed  milestones  or  program  re-plans)  can  incur  increased  cost. 
In  these  circumstances  a  reassessment  of  the  system's  cost-schedule  risk  is 
warranted. 

The  utility  of  joint  probability  models  is  enabling  analysts  to  present  decision¬ 
makers  with  an  integrated  view  of  cost  and  schedule  uncertainties.  In  developing 
the  model  parameters  for  a  specific  system,  elements  that  significantly  contribute  to 
cost  and  schedule  risk  are  identified.  This  fosters  early  recognition  of  inadequately 
^lecified  requirements  and  permits  risk  mitigating  decisions  to  be  made  early  in  the 
system  definition  phases.  For  the  program  manager,  joint  probability  models 
reveal  the  simultaneous  cost  and  schedule  impacts  of  uncertainties  in  such  areas  as 
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technical  realism,  the  soundness  of  the  acquisition  approach,  the  reasonableness  of 
the  schedule,  and  the  accuracy  of  the  cost  and  schedule  estimation  methodologies. 

In  an  environment  of  limited  funds  and  increasingly  challenging  schedules,  it  is 
incumbent  upon  analysts  to  continually  examine  affordability  concerns  relative  to 
the  likelihood  of  jointly  meeting  cost  and  schedule  P(Cost  <  a  and  Schedule  <  b) 
and  cost  for  a  given  schedule  P(Cost  <  a|Schedule  =  b)  against  specific  tradeoffs 
in  system  requirements,  acquisition  strategies,  and  post-development  support. 
Enabling  options  to  be  explored  that  offer  ctecision-makers  economically  sound  and 
risk  mitigating  choices  throughout  the  life  of  a  program  is  the  cost  analyst's  aim 
and  opportunity.  Models  and  methodologies  are  tools  that  provide  a  means  to  that 
end. 
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Appendix  A 

Theoretical  Aspects  of  the 
Normal-LogNormal  Joint  Probability  Distribution 


Let  =  Xj  and  Y2  =  lnX2  whore  X]  and  X2  are  random  variables  defined  on 

andO<X2<“.  If  Yi  and  Y2  each  have  a  nwmal  distribution  then 

E(Yi)  =  Py,  =Px,  =41 

Var(Yi)  =  o^j  =a^,  =0? 


E(Y2)  =  4y5  =42=rH 


Var(  Y2 )  =  o  Y,  =  02  =  In 


(4X2)^‘‘'®X2  j 


(4x2)^ +0X2 


(4x2) 


Assume  that  the  pair 

(X1.X2)  -  NLogN(ai,.P2).(a?.ai  Pi  2)) 
is  a  bivariate  normal-lognormal  distribution  with  density  ftinction 


fXj.Xj  (*1'*2)  = 


(2K)0i02'^l-pf2  X2 
0  otherwise 


-4w 

.  2  -00  <Xi  <«>  ,  0<X2  <~ 


iK^ieie 


P1.2  ~Py,.Y2  “PXi.lnX2 


A-1 


1 _ xi-Mi 


Pu[V  <^1 


Inx2-M2  ]||  lnx2-^; 

®2  y  I  ®2 


Theorem  A-1.  If  (X1.X2)  ~  NLogN((»i,.M2).(Oi  .<^2*  Pi,2)) 


Proof. 

By  definition 


(e°2  -D* 

Pi.2=Px,.X2  02 


Cov(Xi.X2)  _  ^XiX, 

C1.X2  Ox, Ox,  Ox, Ox, 


where 


OXiXj  =J^  j  (Xl”Pl)(X2-M2)fx,.X,(X|.X2)dx,dX2 


OXj  =Oi 

Since  X2  is  lognonnal,  we  have  from  table  B-1  (appendix  B) 
0x2  -1))*'"  =E(X2)(e"2 


®X,X2 


®X,X2 


X,.X2  Ox, 0X2  a,E(X2Xe‘'2 


To  con^Hite  Ox,x2 


,,=iUil.an<l  .,  =  ‘”’■8-1*; 
o,  02 


Ox,X2  = - j"  2  r  i 

2x^1 -pf.2  —  — 

= - .—  f  (Oltl){ll-P2*2l*l 

2kV*'Pu  - 


i»2+02t2  _  V  2(1-Pi2) 


T— (‘?-2Pi.2t,t2+t2) 


where 


Ii=f  e>'2+02‘2e  2(1-Pi.2) 


— -(tj  -2Pi_2tll2 ■*■*2  ) 


To  determine  Ij  note  that  the  integrand  can  be  written  as 


11=6^^21  e  £ 


Letting 


and  noting  ttiat 


we  can  write 


A  =  A(ti )  =  pi,2ti  +  (1  -  Pi,2  ><^2 


t^-2At2=(t2-A)2-A2 


- t?  - ^ - (tj-A)^ 

Ii=e^'2e  J  e  < 


_ !_t2  _ L_a2  _ 

Ii=e»‘2e  e2(i-p?.2) 


A-3 


fXi.X2  (^1**2)  = 


vtiheie 


and 


Q(Xi.X2)  = 


|>^(02^-P?.2)X2 


-(toX2-h)^/2o^(l-pf2) 


b  =  42‘*‘^Pl2^’^l  -Pl) 

<Tl  •“ 


Hierefore 


f.(x.)=j; 


1  -(x|-pi)2/2a? 


[Q(Xi.X2)dX2 


e  •  •  ■  •  •  ^|^Q(xj,X2)dx2 


tnoi 

7275;^® 


(A-2) 


since  the  integrand  is  the  |xobability  density  hinction  of  a  LogN(b,a2(*  -Pu)) 
random  variable. 


To  compute  f2(X2 ) ,  the  density  ftinction  fxj  .X2  '  ^^2 )  '^  factored  as 


fX|.X2(^l'*2)  =  Q  (*1.X2) 


1 _ I_  -(lnX2-P2)^/20? 

JW2  *2 


a 


where 


and 


Hierefore 


Q  (Xl.X2)  = 


^-(xj-l>’)^/2of(l-pj2) 


b*=p,+^p,2(lnx2-P2) 


(A-3) 


A-6 


j  Q*(x,,x2)dx, 


1^102  X2 
1  _Lp-<‘*‘*2->*2)^'2o^ 


^m2  X2 

1  ^-^|(toX2-M2  )^/Of] 

yjlK  02  X2 


since  the  integrand  is  the  (vobability  density  function  of  a  N(b*,af  ( 1  -  Pt2  )) 
random  variable. 


Theorem  A-3.  If  (Xi,X2)- NLogN((4,,^2).(<y?.02.Pl  2)) 

Xi|x2  ~  N(Hi  +— ^pi  2(lnx2  (l~Pi,2)) 

02 

X2  |xi  -  LogN(P2  +  ^  Pl.2(Xl  -  Pi )  •  O2  ( 1  -  p?.2  )  ) 
<^1 


Proof: 

By  definition,  we  have 


_-i[(lnx,-u,  )Vo/) 


O2X2 


fQ  (X1.X2) 


V2ji  02 


1 


*2 


thus,  from  A-3 

Xi|x2  -NIb’.afO-pf  j)) 

vliere 

b*=pi+^p,2(Inx2-M2) 

02 

Similarly 


A-7 


4((x,-U,  )2/a2j 


fl(Xl) 


^2noi  * 


Q(Xi.X2) 

I 

-i  (*1-Mi)^/0,2 


thus,  from  A-2 
uliete 


^XjIx,  (*2)-Q(Xi.X2) 
Xjjx,  -LogNCb.o^d-pf  j)) 


Theorem  A-4.  If(Xi.X2)-NLogN(0i,.p2).(<Tf.(y^2-Pi.2))  then 


Var(X2|xi)  =  e 


2(H2+^Pj.2(Xj-h,)) 


e^e*-l) 


E(Xijx2)  =  Pi  +-2l-pj  -P2) 

02 

Var(X,)x2)  =  o^(I-pf2) 

where  z  =  0^(1 -PJ2) 

Proof: 

Theorem  A-3  proved  that 

Xzlxi  ~LogN(42+^Pi.2(xi-Hi),a^(l-pf2)) 

therefore,  from  table  B-1  (^jpendix  B) 


A-8 


^l2+^Pl.2<*l-MlH+02(l-Pl  2) 

E(X2lxi)  =  e 

2{H2+|^P1.2(*1-Pi)) 

Var(X2|xi)  =  e  e*(e^-l) 

where  z  =  a2(l-pi.2). 

Tlieorem  A-3  proved  tiiat 


Xi|x2  -  N(pi  +-^pi  2(!nx2  -M2) -ofd  -Pu) ) 

O2 

therefore,  it  follows  immediately  from  the  properties  of  the  normal  that 


E(Xi|x2)  =  Mi  +-^P,  2(10x2  -M2) 
02 


and 

Var(Xi|x2)  =  o?(l-pj2) 


Theorem  A-5.  If  (Xi,X2)~NLogN((Mi.M2)*(<Ji>0  2’Pi.2)) 

H2+^Pl 

Median(X2|xi)  =  e 

,  P2+?P,2(*l-Pl>-®2<‘~pf2> 

Mode(X2lxi)  =  e 

Median(Xi  I  X2)  =  E(Xi|x2) 

Mode(Xi|x2)  =  E(Xi|x2) 

Proof: 

Since  X2I  Xj  is  lognormally  distributed  (theorem  A-3),  from  table  B-1  (appendix 
B) 


Median(X2|xi)  =  e"  =e 


M2+^Pi2<X|-Pi) 

CTi  *** 


and 


A-9 


M2+^Pl  ->(Xl-Pl  )-0?(l-pf-. ) 
Mode(X2lxi)  =  e  “ 

Since  Xi|  X2  is  normally  distributed  (theorem  A-3).  it  follows  immediately  that 

Median(Xilx2)  =  E(X,|x2) 

Mode(Xi|x2)  =  E(X,|x2) 

Property  A*!.  If  (Xi.X2)~NLogN((p,.ii2).(o?.02.Pi.2)) 

E(Xi|Median(X2|Mi))  =  p, 

Proof: 

From  theorem  A<4  it  was  established  that 

E(Xi|x2)  =  Hi  +^p.  ,(10X2  -M2) 

02 

From  theorem  A-5,  we  may  write 

Median(X2|x|  =M|)=e'^2 

It  follows  that 

E(Xi|Median(X2|li,))  =  E(X,|e»^2)=p,  +^pj  ^(Ine^'z  -M2) 

=  Mi+|^Pi2(P2-M2) 

=  Mi 


A-IO 


Appendix  B 

Theoretical  Aspects  of  the 

LogNormal  and  Bivariate  LogNormal  Probability  Distributions 


The  Univariate  LogNormal 

If  X  is  a  nonnegative  random  variable  and  Y  =  In  X  has  a  normal  distribution  with 
Py  =  E(Y)  and  o^  =  Var(Y) 


then  X  is  said  to  be  lognormally  distributed,  that  is 

X-LogNCpyOy) 

with  density  function 


fx(x)  = 


1 


~j2t 


for  X  >  0 


7t  OyX 

0  otherwise 


(B-1) 


Equation  B-1  is  referted  to  as  the  lognormal  distribution.  Important  properties  of 
f^ (x)  are  summarized  in  table  B- 1 . 


Table  B- 1  Properties  of  the  Lognormal  Distribution 


ftoperty 

Expression 

Iix=E(X) 

o|=  Var(X) 

-1) 

Mode(X) 

gt^Y-Oy 

Median(X) 

e^'Y 

B-1 


Theorem  B*l.  If  X  is  lognormally  distributed  with  mean  hx  and  variance  Gx 
that 


and 


(Mx)'* 

(iix)^+Ox 


Oy  =ln 


(Mx)^+gx 

(4x)^ 


The  Bivariate  LogNormal 

Let  Yi  =  InXi  and  Y2  =  lnX2  where  Xi  and  X2  are  random  variables  defined  on 
0  <  xi  <  »  and  0  <  X2  <  «> .  Define 

E(Yj)  =  iiY,  =iti  and  Var(Yi)  =  o^_  =0? 

for  i  =  1, 2.  Suppose  that  Y 1  and  Y  2  each  have  a  normal  distribution  and  that  the 
pair 

(Xi .  X2 )  ~  Bivariate  LogN  ((p, ,  P2 ).  (of .  o  2 .  p,  2 )) 
is  a  bivariate  lognoimal  distribution  with  density  function 


fXi.X2(*l>M)  =  i 


I  -^v 

- P  — e  2 

(27t)0, 02^1 -pf  2*1*2 


0<xi  <<»,0<X2  <«> 


(B-2) 


0  otherwise 


where 


P1.2  “Py,.Y2  "PlnXi,lnX2 


B-2 


1  Tlnxi-tiif  r Inxt  -4i  Y lnx2  -M2  U Inx;  -^2  t 

l-pf.2 11  J  ‘*^1  1  <^2  )[  02  J 


1,  r  (Px  )'‘  1.2  2  ,  [(Px  )^+<Jx 

PY=4i=~l*> - T — T~  and  OY=Oi=ln - - - = — ^ 

2  <^x,)^+4,j  ‘  (Mx.)^ 


for  i  =  1, 2. 


Theorem  B-2.  If  (X1.X2)- BivariateLogN((Hi.H2).(Oi 


^P,  20,02 


Proof: 

By  definition 


x,.x2  ro2  772 
Ve°'  -  I'Ve  2  -1 


p  ^  Cov(Xi.X2)  ^  E(XiX2)-E(Xi)E(X2)  ^ 

X,,Xj  ^x,^Xj  ®X,®Xj 


Since  Y,  =  InX,  and  Y2  =  10X2  we  can  write 

E(XiX2) =E(e^>e^2 )  =  E(e^'  ^^2 ) 

Recall  that  Yj  -NCjii.of)  for  i  =  1,  2.  Thus,  E(e^''''^2  jjs  recognized  as  a 
qtedal  evaluation  of  the  moment  generating  function  of  a  bivariate  normal,  which 
is 

M(ti. t2 )  =  E(e‘>^> )  =  r  Te'-y- f(y , , y2 )dyidy2 

_  *?  +2Py,  Y20i®2*I*2  +02*2  ) 


fOT  some  real  t,  and  t2.  Therefore, 


B-3 


To  determine  the  remaining  terms  in  equation  (B-3),  we  note  that  for  r  >  Othe 
moments  of  X  are 

=  (B^) 

Thus. 

E(Xj)  =  e^‘'^^®' 

E(X2)  =  e*'^'^^°> 

Var(X,)  =  E(X?)-(E(X,))2 

_^24i+2of  _g2M,+o2 

Var(X2 )  =  E(X^ )  -(E(X2 ))^  =  e  ^^*2  )2 


_g 2^2 +2^2  _e^^*2‘^°2 


Substituting  into  equation  B-3,  we  have 


X1.X2 


E(X,X2)-E(Xi)E(X2) 


<^XiOX2 

.2. 


^(^l+ir2)+i(Oi +2Py,  Y20i02+02>  )(e>*2+J°2  ) 

^^t’^2  ^^2m+2af  _g2M,+of  ^^2iX2+2al  _g2p2- 


?+o? 


which  can  be  facttxed  as: 


g(p,+p,)+Ka?+<ij)^gP,^a,a, 
g(P,+P,)+i(0f+0j)^g0f  _j^gO*  _j 


where  Pi  2  "Pyi,Y2  ~PinXi.inX2 


B4 


Thus, 


Tliadbre 


1  J_  -(lnxi-M,)2/2aJ 


Q(x,.X2)dX2 


KOl  X, 

[y^nOiXi 
_  1  ^-^[(Inxi-Ui  /a^  ] 

■yjln  OlX| 

since  the  integrand  is  the  probability  density  ftinction  of  a  LogN(b,02(l  -Pi,2)) 
random  variable. 


To  compute  f2(X2).  the  density  ftinction  fx,.X2  (xi.X2)‘s  factored  as 


fXi.X2^*l'*2)-Q  (Xl.»2)j^ 


1  1  -(lnX2-H2)^/202 


JIO2X2 


(B-7) 


viliere 


and 


Q  (Xl.X2)  = 


y/2n(ai^l-pi2)xi 


-(Inxi-b*  )2/2of(l-pf2) 


Thereftxe 


b*=M.i+^pj2(lnx2-ii2) 

02  ’ 


k~Q*(Xi.X2)dXi 


_  I  1  1  ^-(lnX2-P2)^^2g2 

[V2j102  X2 
_  1  i  ^,-(ln  X2  -M2  )* 

V2n02  X2 

*2  2 

since  the  integrand  is  the  probability  density  ftinction  of  a  LogN(b  ,02  (1  -pi,2)) 
random  variable. 


Theorem  B-4.  If  (Xj .X2 )  -  Bivariate LogN ((Mi ,^2  )•  'O  2  •  Pi  2 

Xilx2  ~LogN(Mi+-^Pi,2(lnx2  -M2).<J?0-pf.2)) 
O2 

X2 j  xi  -  Lx)gN(M2  +  ^Pi.2 (In  X,  -  M, ) .  0^(1  -  pjj ) ) 

Ol 


Proofi 

By  definition 


fx..X,(Xt»X2)  IV2WO2X2 _ 

~  f2(X2)  "  ^ 

yjln  02  ^2 


fx,|xj<*l)  =  Q*<’^l-*2) 

thus,  from  B-7 

vdne 

Xi(x2-LogN(b*,0?(I-pf2)) 

b  =Mi+^Pi2(*®*2~P2) 

02  ’ 

Sinfilaily 

(  I  ^-if(lnx,-H,)’ 

fx.,x,  (Xl,X2) 

[^j2n  01  Xj 

1  ^-il(lnx, 

i/2n0iXi 

Q(Xi,X2) 


thus,  from  B'6 


^X2|x,<^2)  =  Q(Xi.X2) 
Xjjxi  ~LogN(b,o§(l-pf  2)) 


where 

<j, 

Theorem  B-5.  If (Xl.X2)~BivariateLogN(0l,.^2).(Ol.oVp^,2))  *ben 
RXjIx,)-  xf ’ 

Vxr(X  j|  X,)  =  ’e“(e'  - 1) 

E(X,|xj)  =  ’ 

Var(Xi|x2)  =  e'"  (e*’  - 1) 

where 

z  =  o^(l-pf2)  and  z*  =  of(l-pf2) 

Proof: 

Theorem  B-4  proved  that 

X2j  xi  -  LogN(p2  +  ^Pi.2(ln  xi  -  Pi ) .  <T§(1  -  pf  2 ) ) 
Therefore,  fiom  table  B-1 


and 


H2+'^Pj,2(lnxi-(ii  (t“P?,2 ) 

E(X2|xi)  =  e 

p-Pl.2  P2~Pl.2Pl+io2<‘-Pl2> 


2(jl2+^^1.2(lnXi-Hi))  2  2  2,,  -2  , 

Var(X2|xi)=e 


-1) 


Iheotem  B-4  also  proved  that 


B-8 


X|{x2  “LogN(4i  + — 2(lnX2  -P2)*®1  (^~Pl  2)  ) 
02 

Therefore,  firom  table  B-1 

^  1  +— *'Pi,2  (ta  *2  2  )+4'0?  ( l~P?,2  ) 

E(Xilx2)  =  e 


=  v°2 


^Pl.2  m-%l.2P2+i°?<‘~Pf.2> 


2(Pi+^l-Pi.2(*nx2-M2))  _2,,__2  .  rt^n-o-  1 

Var(X,|x2)  =  e  e°'<‘ 


Theorem  B-6.  If(Xi,X2)~BivariateLogN((p|.p2).(Oi.tT2.Pi2)) 
Meclian(X2|xi)  =  x®'**''^e**^ 

Mode(X,|x,)  =  x^'-^e’‘=-^'-^‘“-‘’^‘'-‘’u‘ 
Medi».(X,|x,).xf'’'’=.'‘'-§‘'''^'‘' 


Pnxfi 


From  theorem  B-4  and  table  B-1,  it  follows  that 


M«..n<X,|xO  =  /^*^"“""-‘'"=xF^-e-^-t'u^. 

Mode(X2lxi)  =  e  ®'  ’ 

^  ?Pl.2  P2-ifPl.2Pl-®2<'-pf.2^ 


B-9 


2.0,,  u,_2p,  n, 

Median(Xi|x2)  =  e  =x^  ‘-^e 

■*^1  2  *2 2  )-of  (l-pf  2  ) 

Mo<le(X,|x2)  =  e  ®2  • 


_,^pu/«-S<‘i.2‘^2-or(‘-pr.2) 

*  ^2  ^ 


Property  B-1.  If  (Xi,X2)~BivariateLogN((Pi.H2).(o?.02'Pi  2^^  then  the 
cooditioaal  coefficient  of  dispasion  D  is 

Tv.  t/V  1^  \  »'  ' 


%|i2  E(Xijx2) 

E(X2|x,)  ^ 


'*^*®**  *^Xi|x2  are  the  cumulative  distributions  of  fx,|x2  ^X2|x,  • 

TMs  condlary  is  stated  widxxit  proof  since  it  directly  follows  from  theorem  B-S. 


B-iO 


